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Art Unit: 2165 



DETAILED ACTION 



Continued Examination Under 37 CFR 1 1 14 

1 . A request for continued examination under 37 CFR 1.114, including the fee set 
forth in 37 CFR 1 .17(e), was filed in this application after final rejection. Since this 
application is eligible for continued examination under 37 CFR 1.114, and the fee set 
forth in 37 CFR 1 .17(e) has been timely paid, the finality of the previous Office action 
has been withdrawn pursuant to 37 CFR 1.114. Applicant's submission filed on 4- 
December-2006 has been entered. 



2. The Amendment filed on December 8, 2006 has been received and entered. 
Claims 1-88 are pending. 



3. The amendment overcomes the objections are some rejections under 101 . 



Double Patenting 

4. The nonstatutory double patenting rejection is based on a judicially created doctrine grounded in public 
policy (a policy reflected in the statute) so as to prevent the unjustified or improper timewise extension of the 
"right to exclude" granted by a patent and to prevent possible harassment by multiple assignees. A 
nonstatutory obviousness-type double patenting rejection is appropriate where the conflicting claims are not 
identical, but at least one examined application claim is not patentably distinct from the reference claim(s) 
because the examined application claim is either anticipated by, or would have been obvious over, the 
reference claim(s). See, e.g., In re Berg, 140 F.3d 1428, 46 USPQ2d 1226 (Fed. Cir. 1998); In re Goodman, 11 
F.3d 1046, 29 USPQ2d 2010 (Fed. Cir. 1993); In re Longi, 759 F.2d 887, 225 USPQ 645 (Fed. Cir. 1985); In re 
Van Ornum, 686 F.2d 937, 214 USPQ 761 (CCPA 1982); In re Vogel, 422 F.2d 438, 164 USPQ 619 (CCPA 
1970); and In re Thorington, 418 F.2d 528, 163 USPQ 644 (CCPA 1969). 

A timely filed terminal disclaimer in compliance with 37 CFR 1.321(c) or 1.321(d) may be used to 
overcome an actual or provisional rejection based on a nonstatutory double patenting ground provided the 
conflicting application or patent either is shown to be commonly owned with this application, or claims an 
invention made as a result of activities undertaken within the scope of a joint research agreement. 
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Effective January 1, 1994, a registered attorney or agent of record may sign a terminal disclaimer. A 
terminal disclaimer signed by the assignee must fully comply with 37 CFR 3.73(b). 

5. Claims 1 , 20, 39, 58, 81-84 are provisionally rejected on the ground of 
nonstatutory obviousness-type double patenting as being unpatentable over claims 1, 
16, 31-32 of copending Application No. 10/626,856. Although the conflicting claims are 
not identical, they are not patentably distinct from each other because the claims use 
determining steps that are clearly similar. For example in claim 1 of the instant 
application applicant states "determining source-identified training stories", in claim 1 of 
application 10/626,856 applicant states "determining a source-identified story corpus, 
each story associated with at least one event". In effect both claims state the same 
thing. Other steps in reminder of the claims follow the same reasoning. 

This is a provisional obviousness-type double patenting rejection because the 
conflicting claims have not in fact been patented. 

Claim Rejections - 35 USC § 101 

6. 35 U.S.C. 101 reads as follows: 

Whoever invents or discovers any new and useful process, machine, manufacture, or composition of 
matter, or any new and useful improvement thereof, may obtain a patent therefor, subject to the 
conditions and requirements of this title. 

7. Claims 39, 58 and 81-84 are rejected under 35 U.S.C. 101 because the claimed 
invention is directed to non-statutory subject matter. 
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Claims 39, 58 and 81-84 list computational steps in a program without tangible, 
useful, concrete result. The claims do not have any visible result or output. The steps of 
"determining" are missing real world result. Indicating doesn't actually have show or 
output the result of determination. There needs to be an outputting of the link or storing 
for further use. 

Claim 82 states the intended use by use of word "useable". To overcome this 
type of rejection, claims could be amended to recite definite functionality (i.e. executed" 
or "processed" or "to perform") 

Claim Rejections -35 (JSC §112 

8. The following is a quotation of the first paragraph of 35 U.S.C. 1 1 2: 

The specification shall contain a written description of the invention, and of the manner and process of 
making and using it, in such full, clear, concise, and exact terms as to enable any person skilled in the 
art to which it pertains, or with which it is most nearly connected, to make and use the same and shall 
set forth the best mode contemplated by the inventor of carrying out his invention. 

9. Claim 77 is rejected under 35 U.S.C. 112, first paragraph, as failing to comply 
with the enablement requirement. The claim(s) contains subject matter which was not 
described in the specification in such a way as to enable one skilled in the art to which it 
pertains, to make and/or use the invention. 

The claim uses the phrase "source mode" which is not sufficiently described in 
the specification. Paragraph 01 15 describes transformation of first mode, however it 
does not mention first source mode. The examiner is unsure whether there is a 
distinction between the two. 
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10. The.following is a quotation of the second paragraph of 35 U.S.C. 112: 

The specification shall conclude with one or more claims particularly pointing out and distinctly 
claiming the subject matter which the applicant regards as his invention. 

11. Claim 77 is rejected under 35 U.S.C. 112, second paragraph, as being indefinite 
for failing to particularly point out and distinctly claim the subject matter which applicant 
regards as the invention. 

Claim 77 recites "source mode" which the examiner is not sure what it stands for. 
Further explanation is required. 

Claim Rejections - 35 USC § 102 

12. The following is a quotation of the appropriate paragraphs of 35 U.S.C. 102 that 
form the basis for the rejections under this section made in this Office action: 

A person shall be entitled to a patent unless - 

(a) the invention was known or used by others in this country, or patented or described in a printed 
publication in this or a foreign country, before the invention thereof by the applicant for a patent. 

Claim 77-80 are rejected under 35 U.S.C. 102(a) as being anticipated by Brown, Ralf D. 
"Dynamic Stopwording for Story Link Detection", (hereafter referred as Brown) . 

As per claim 77 Brown is directed to a method of determining a stopword list 
comprising the steps of: 
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determining a source-identified training corpus of text information (page 1 , 
column 2, lines 26-29); 

determining a verified first source-mode transformation of the source-identified 
training corpus text from a first source mode to a second source mode (page 1 , column 
2, lines 26-29; page 1 column 2, lines 33-40, wherein the "transformation" would be the 
"single-pass incremental clustering method"); 

determining an un-verified second source-mode transformation of the source- 
identified training corpus text from a first source mode to a second source mode (page 

1, column 2, lines 17-18, wherein "un-verified" means any "story from a newswire"); 

determining at least one transformation errors associated with distribution 
differences between the first and second transformations and identified sources (page 

2, column 2, lines 4-6); 

determining and storing at least one source-specific transformation actions for 
the determined transformation errors in a memory (page 2, column 1, lines 2-6); and 

identifying and transforming transformation errors in other transformed source- 
identified texts based on the source-specific transformation actions in a memory (page 
2, column 1, lines 2-7). 

As per claim 78 Brown is directed to the first source mode is at least one of a text 
source, an optical character recognition source and an automatic speech recognition 
source (page 1 , column 2, lines 22-24). 
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As per claim 79 Brown is directed to the second source mode is at least one of a 
text source, an optical character recognition source and an automatic speech 
recognition source (page 1, column 2, lines 22-24; page 2, column 1, lines 6-8). 

As per claim 80 Brown is directed to wherein the source-specific transformation 
is at least one of a removal, a repair and a normalization transformation (page 2, 
column 1, lines 4-6). 

Claim Rejections - 35 USC § 103 

13. The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 102 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 

14. Claims 1-5,9-10, 14-,24, 28-29, 33-43, 47-48, 52-62, 66-67, 71-76 and 81-88 are 
rejected under 35 U.S.C. 103(a) as being unpatentable over Sundaresan et al. (US 
Patent 6,606,620 B1) in view of Pirolli et al. (US 5,835,905). 

As per claim 1 Sundaresan et al. is directed to a computer-implemented method 
of determining predictive models for a linked event detection system comprising the 
steps of: 

determining source-identified training stories (column 3, lines 16-17, wherein 
"stories" means "documents"); 
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determining link label information for the at least one story-pair (column 9, lines 

8-9); 

determining and storing at least one predictive model in the memory based on 
the inter-story similarity vectors and the link label information (column 10, lines 5-13); 
and 

Sundaresan et aL does not teach determining inter-story similarity vectors for at 
least one story-pair. 

Pirolli et al. teaches determining inter-story similarity vectors for at least one 
story-pair ( Pirolli et al. , column 7, lines 53-65, wherein "pages" could mean "stories"). 

It would have been obvious to one in of ordinary skill in the art at the time the 
invention was made to modify Sundaresan et al. by teachings of Pirolli et al. to include 
determining inter-story similarity vectors for at least one story-pair because it provides 
the similarity measure of documents. 

As per claim 2 Sundaresan et al. as modified is directed to a step of determining 
inter-story similarity vectors comprises the steps of: 

determining at least one inter-story similarity metric for the story-pairs 
( Sundaresan et al. , column 4, lines 9-25); 

and determining at least one source-pair statistics for the at least one story-pair 
( Sundaresan et al. , column 10, lines 15-17). 
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As per claim 3 Sundaresan et al. as modified is directed to a determining inter- 
story similarity vectors further comprise the step of normalizing the inter-story similarity 
metric based on the source-pair statistics ( Sundaresan et al , column 10, lines 17-22). 

As per claim 4 Sundaresan et al. as modified is directed to a determining inter- 
story similarity vectors further comprise the step of incrementally normalizing the inter- 
story similarity metric based on the source-pair statistics ( Sundaresan et al. , column 10, 
lines 16-22). 

As per claim 5 Sundaresan et al. as modified is directed to the inter-story 
similarity metric is normalized based on at least one of subtraction and division 
( Sundaresan et al. , column 8, lines 22-27). 

As per claim 9 Sundaresan et al. as modified is directed to a comprising the step 
of transforming the source-identified training stories ( Sundaresan et al. , column 1, line 
63, wherein the "training stories" are in English). 

As per claim 10 Sundaresan et al. as modified is directed to transforming the 
source-identified training stories is at least one of translating, transcribing and 
linguistically transforming (Sundaresan et aL , column 1, line 63; column 2, line 43, 
wherein the HTML and XML are in English, therefore translation will not be necessary). 
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As per claim 14 Sundaresan et al. as modified is directed to at least one inter- 
story similarity metric is normalized based on at least one of a source-pair identified 
similarity statistic ( Sundaresan et al. , column 10, lines 15-17). 

As per claim 15 Sundaresan et al. as modified is directed to at least one 
predictive model is at least one of: a classifier, a support vector machine, a decision tree 
and a Naive-Bayes classifier ( Sundaresan et al. , column 3, lines 13-14). 

As per claim 16 Sundaresan et al. as modified is directed to at least one of the 
source-pair similarity statistics are determined based on a source hierarchy 
( Sundaresan et al. , column 3, lines 50-51). 

As per claim 17 Sundaresan et al. as modified is directed to the source hierarchy 
is determined based on at least one source characteristic ( Sundaresan et al. , column 3, 
lines 61-65, wherein "characteristic" means "leaf). 

As per claim 18 Sundaresan et al. as modified is directed to the source 
characteristic is at least one of a language characteristic, an input mode characteristic, 
a genre characteristic, a source name characteristic and a transformation characteristic 
( Sundaresan et al. , column 3, lines 54-60, wherein "language characteristic" means how 
the words in a document are related to each other). 
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As per claim 19 Sundaresan et al. as modified is directed to the source-pair 
similarity statistic for a new source is determined based on at least one source 
characteristic of the new source ( Sundaresan et al. . column 3, lines 50-53, wherein 
each new source has different hierarchy). 

As per claim 20 Sundaresan et al. is directed to a linked event detection training 
system comprising: 

an input/output circuit (column 7, lines 34-35, wherein it is inherent for computer 
to have input/output device circuit); 

a memory (column 7, lines 34-35, wherein it is inherent for computer to have 
memory); 

a processor that receives source-identified training stories and associated link 
label information for at least one story-pair via the input/output circuit (column 7, lines 
34-35, wherein it is inherent for computer to have a processor); 

and a predictive model determining circuit that determines and stores at least 
one predictive model based on the inter-story similarity vectors and the link label 
information (column 10, lines 5-13) 

Sundaresan et al. does not teach an inter-story similarity vector determining 
circuit that determines an inter-story similarity vectors in memory for at least one story- 
pair of the source-identified stories. 

Pirolli et al. teaches an inter-story similarity vector determining circuit that 
determines an inter-story similarity vectors in memory for at least one story-pair of the 
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source-identified stories (Pirolli et aL column 7, lines 53-65, wherein "pages" could 
mean "stories"). 

It would have been obvious to one in of ordinary skill in the art at the time the 
invention was made to modify Sundaresan et al. by teachings of Pirolli et al. to include 
an inter-story similarity vector determining circuit that determines an inter-story similarity 
vectors in memory for at least one story-pair of the source-identified stories because it 
provides the similarity measure of documents. 

As per claim 21 Sundaresan et al. as modified is directed to the inter-story 
similarity vector determining circuit is comprised of: 

a similarity metric determining circuit that determines at least one inter-story 
similarity metric for the at least one story-pair ( Sundaresan et al. , column 4, lines 9-25); 

and a similarity statistics determining circuit that determines at least one source- 
pair statistic for the at least one story-pair (Sundaresan et al. , column 10, lines 15-17). 

As per claim 22 Sundaresan et al. as modified is directed to the inter-story 
similarity vector determining circuit normalizes the inter-story similarity metric based on 
the source-pair statistics ( Sundaresan et al. , column 10, lines 17-22). 



As per claim 23 Sundaresan et al. as modified is directed to the inter-story 
similarity vector determining circuit incrementally normalizes the inter-story similarity 
metric based on the source-pair statistics ( Sundaresan et al. , column 10, lines 16-22). 
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As per claim 24 Sundaresan et al. as modified is directed to at least one of the 
inter-story similarity metrics is normalized based on at least one of a subtraction and a 
division operation ( Sundaresan et al. , column 8, lines 22-27). 

As per claim 28 Sundaresan et al. as modified is directed to a comprising the 
step of transforming the source-identified training stories ( Sundaresan et al. , column 1, 
line 63, wherein the "training stories" are in English). 

As per claim 29 Sundaresan et al. as modified is directed to transforming the 
source-identified training stories is at least one of translating, transcribing and 
linguistically transforming (Sundaresan et al. , column 1, line 63; column 2, line 43, 
wherein the HTML and XML are in English therefore translation will not be necessary). 

As per claim 33 Sundaresan et al. as modified is directed to the at least one 
inter-story similarity metric is normalized based on at least one of a source-pair 
identified similarity statistic ( Sundaresan et al. , column 10, lines 15-17). 

As per claim 34 Sundaresan et al. as modified is directed to the at least one 
predictive model is at least one of: a classifier, a support vector machine, a decision tree 
and a Naive-Bayes classifier ( Sundaresan et al. , column 3, lines 13-14). 
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As per claim 35 Sundaresan et al. as modified is directed to the source-pair 
identified similarity statistic is determined based on a source hierarchy ( Sundaresan et 
aL, column 3, lines 50-51). 

As per claim 36 Sundaresan et al. as modified is directed to the source hierarchy 
is determined based on at least one of a source characteristic ( Sundaresan et al. , 
column 3, lines 61-65, wherein "characteristic" means "leaf). 

As per claim 37 Sundaresan et al. as modified is directed to the source 
characteristic is at least one of a language characteristic, an input mode characteristic, 
a genre characteristic, a source name characteristic and a transformation characteristic 
( Sundaresan et al. , column 3, lines 54-60, wherein "language characteristic" means how 
the words in a document are related to each other). 

As per claim 38 Sundaresan et al. as modified is directed to the source-pair 
similarity statistic for a new source is determined based on at least one source 
characteristics of the new source ( Sundaresan et al. , column 3, lines 50-53, wherein 
each new source has different hierarchy). 



As per claim 39 Sundaresan et al. is directed to a computer-implemented method 
of linked event detection comprising the steps of: 
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determining source-Identified training stories (column 3, lines 16-17, wherein 
"stories" means "documents"); 

determining at least one predictive model for link detection (column 10, lines 5- 

13); 

and determining a link between the story-pairs based on the predictive model 
and the inter-story similarity vector (column 10, lines 5-13, wherein sorting determines 
the link); and 

indicating the link (column 7, line 67; column 8, lines 1-4). 

Sundaresan et al. does not teach determining inter-story similarity vectors in a 
memory for the story-pairs of the source-verified stories. 

Pirolli et al. teaches determining inter-story similarity vectors in a memory for the 
story-pairs of the source-verified stories ( Pirolli et al. , column 7, lines 53-65, wherein 
"pages" could mean "stories"). 

It would have been obvious to one in of ordinary skill in the art at the time the 
invention was made to modify Sundaresan et al. by teachings of Pirolli et al. to include 
determining inter-story similarity vectors in a memory for the story-pairs of the source- 
verified stories because it provides the similarity measure of documents. 

As per claim 40 Sundaresan et al. as modified is directed to a step of determining 
inter-story similarity vectors comprises the steps of: 

determining at least one inter-story similarity metric for each story-pair 
( Sundaresan et al. . column 4, lines 9-25); 
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and determining source-pair statistics for the story-pairs ( Sundaresan et aL 
column 10, lines 15-17). 

As per claim 41 Sundaresan et aL as modified is directed to a determining inter- 
story similarity vectors further comprise the step of normalizing the inter-story similarity 
metric based on the source-pair statistics ( Sundaresan et aL , column 10, lines 17-22). 

As per claim 42 Sundaresan et al. as modified is directed to a determining inter- 
story similarity vectors further comprise the step of incrementally normalizing the inter- 
story similarity metric based on the source-pair statistics ( Sundaresan et aL , column 10, 
lines 16-22). 

As per claim 43 Sundaresan et al. as modified is directed to the inter-story 
similarity metric is normalized based on at least one of subtraction and division 
( Sundaresan et al. . column 8, lines 22-27). 

As per claim 47 Sundaresan et al. as modified is directed to a comprising the 
step of transforming the source-identified training stories (column 1, line 63, wherein the 
"training stories" are in English). 



As per claim 48 Sundaresan et al. as modified is directed to transforming the 
source-identified training stories is at least one of translating, transcribing and 
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linguistically transforming ( Sundaresan et al. , column 1, line 63; column 2, line 43, 
wherein the HTML and XML are in English therefore translation will not be necessary). 

As per claim 52 Sundaresan et al. as modified is directed to the at least one 
inter-story similarity metric is normalized based on at least one of a source-pair 
identified similarity statistic ( Sundaresan et al. , column 10, lines 15-17). 

As per claim 53 Sundaresan et al. as modified is directed to the at least one 
predictive model is at least one of: a classifier, a support vector machine and a decision 
tree, a Naive-Bayes-classifier ( Sundaresan et al. , column 8, lines 22-27). 

As per claim 54 Sundaresan et al. as modified is directed to the source-pair 
similarity statistic for a new source is determined based on at least one source 
characteristics of the new source ( Sundaresan et al. , column 3, lines 50-51). 

As per claim 55 Sundaresan et al. as modified is directed to the source hierarchy 
is determined based on at least one of a source characteristic ( Sundaresan et al , 
column 3, lines 61-65, wherein "characteristic" means "leaf). 

As per claim 56 Sundaresan et al. as modified is directed to the source 
characteristic is at least one of a language characteristic, an input mode characteristic, 
a genre characteristic, a source name characteristic and a transformation characteristic 
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( Sundaresan et aL column 3, lilies 54-60, wherein "language characteristic" means how 
the words in a document are related to each other). 

As per claim 57 Sundaresan et al. as modified is directed to the source-pair 
similarity statistic for a new source is determined based on at least one source 
characteristics of the new source ( Sundaresan et al. , column 3, lines 50-53, wherein 
each new source has different hierarchy). 

As per claim 58 Sundaresan et aL is directed to linked event detection system 
comprising: 

an input/output circuit (column 7, lines 34-35, wherein it is inherent for computer 
to have input/output device circuit); 

a memory (column 7, lines 34-35, wherein it is inherent for computer to have 
memory); 

a processor that receives source-identified training stories via the input/output 
circuit (column 7, lines 34-35, wherein it is inherent for computer to have processor); 

and a link determining circuit that determines and indicates links between story- 
pairs based on a predictive model in the memory and the inter-story similarity vectors 
(column 10, lines 5-13, wherein sorting determines the link; column 7, line 67; column 8, 
lines 1-4). 
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Sundaresan et al. does not teach an inter-story similarity vector determining 
circuit that determines inter-story similarity vectors in the memory for the story-pairs of 
the source-identified stories. 

Pirolli et al. teaches an inter-story similarity vector determining circuit that 
determines inter-story similarity vectors in the memory for the story-pairs of the source- 
identified stories (Pirolli et aL column 7, lines 53-65, wherein "pages" could mean 
"stories"). 

It would have been obvious to one in of ordinary skill in the art at the time the 
invention was made to modify Sundaresan et al. by teachings of Pirolli et al. to include 
an inter-story similarity vector determining circuit that determines inter-story similarity 
vectors in the memory for the story-pairs of the source-identified stories because it 
provides the similarity measure of documents. 

As per claim 59 Sundaresan et al. as modified is directed to the inter-story 
similarity vector determining circuit is comprised of: 

a similarity metric determining circuit that determines at least one inter-story 
similarity metric for the story-pairs ( Sundaresan et al. , column 4, lines 9-25); 

and a similarity statistics determining circuit that determines source-pair statistics 
for the story-pairs (column 10, lines 15-17). 
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As per claim 60 Sundaresan et al. as modified is directed to the inter-story 
similarity vector determining circuit normalizes the inter-story similarity metric based on 
the source-pair statistics ( Sundaresan et al. . column 10, lines 17-22). 

As per claim 61 Sundaresan et al. as modified is directed to the inter-story 
similarity vector determining circuit incrementally normalizes the inter-story similarity 
metric based on the source-pair statistics ( Sundaresan et al. . column 10, lines 16-22). 

As per claim 62 Sundaresan et al. as modified is directed to at least one of the 
inter-story similarity metrics is normalized based on at least one of a subtraction and a 
division operation ( Sundaresan et al. , column 8, lines 22-27). 

As per claim 66 Sundaresan et al. as modified is directed to a comprising the 
step of transforming the source-identified training stories (Sundaresan et al. . column 1, 
line 63, wherein the "training stories" are in English). 

As per claim 67 Sundaresan et al. as modified is directed to transforming the 
source-identified training stories is at least one of translating, transcribing and 
linguistically transforming ( Sundaresan et al. . column 1, line 63; column 2, line 43, 
wherein the HTML and XML are in English therefore translation will not be necessary). 
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As per claim 71 Sundaresan et al. as modified is directed to the at least one 
inter-story similarity metric is normalized based on at least one of a source-pair 
identified similarity statistic ( Sundaresan et al. , column 10, lines 15-17). 

As per claim 72 Sundaresan et al. as modified is directed to the predictive model 
is at least one of: a classifier, a support vector machine and a decision tree, a Naive- 
Bayes classifier ( Sundaresan et aL column 8, lines 22-27). 

As per claim 73 Sundaresan et al. as modified is directed to the source-pair 
identified similarity statistic is determined based on a source hierarchy ( Sundaresan et 
aL, column 3, lines 50-51). 

As per claim 74 Sundaresan et al. as modified is directed to the source hierarchy 
is determined based on at least one of a source characteristic ( Sundaresan et al. , 
column 3, lines 61-65, wherein "characteristic" means "leaf). 

As per claim 75 Sundaresan et al. as modified is directed to the source 
characteristic is at least one of a language characteristic, an input mode characteristic, 
a genre characteristic, a source name characteristic and a transformation characteristic 
( Sundaresan et al. , column 3, lines 54-60, wherein "language characteristic" means how 
the words in a document are related to each other). 
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As per claim 76 Sundaresan et al. as modified is directed to the source-pair 
similarity statistic for a new source is determined based on at least one source 
characteristics of the new source ( Sundaresan et al. , column 3, lines 50-53, wherein 
each new source has different hierarchy). 

As per claim 81 Sundaresan et al. is directed to computer readable storage 
medium comprising: computer readable program code embodied on the computer 
readable storage medium, the computer readable program code usable to program a 
computer to determine at least one predictive model for a linked event detection system 
comprising the steps of: 

determining source-identified training stories (column 3, lines 16-17, wherein 
"stories" means "documents"); 

determining link label information for the at least one story-pair (column 9, lines 

8-9); 

and determining at least one predictive model in the memory based on the inter- 
story similarity vector and the link label information (column 10, lines 5-13); 

Sundaresan et al. does not teach determining inter-story similarity vectors in a 
memory for at least one story-pair of the source-identified training stories 

Pirolli et al. teaches determining inter-story similarity vectors in a memory for at 
least one story-pair of the source-identified training stories ( Pirolli et al. , column 7, lines 
53-65, wherein "pages" could mean "stories"). 
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It would have been obvious to one in of ordinary skill in the art at the time the 
invention was made to modify Sundaresan et al. by teachings of Pirolli et al. to include 
determining inter-story similarity vectors in a memory for at least one story-pair of the 
source-identified training stories because it provides the similarity measure of 
documents. 

As per claim 82 Sundaresan et al. is directed to computer readable storage 
medium comprising: computer readable program code embodied on the computer 
readable storage medium, the computer readable program code usable to program a 
computer to determine at least one predictive model for a linked event detection system 
comprising: 

instructions to determine source-identified training stories (column 3, lines 16-17, 
wherein "stories" means "documents"); 

instructions to determine link label information for the at least one story-pair 
(column 9, lines 8-9); 

and instructions to determine at least one predictive model in the memory based 
on the inter-story similarity vector and the link label information (column 10, lines 5-13); 
and 

Sundaresan et al. does not teach instructions to determine inter-story similarity 
vectors in memory for at least one story-pair of the source-identified training stories. 
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Pirolli et al. teaches instructions to determine inter-story similarity vectors in 
memory for at least one story-pair of the source-identified training stories ( Pirolli et al. , 
column 7, lines 53-65, wherein "pages" could mean "stories"). 

It would have been obvious to one in of ordinary skill in the art at the time the 
invention was made to modify Sundaresan et al. by teachings of Pirolli et al. to include 
instructions to determine inter-story similarity vectors in memory for at least one story- 
pair of the source-identified training stories because it provides the similarity measure of 
documents. 

As per claim 83 Sundaresan et al. is directed to computer readable storage 
medium comprising: computer readable program code embodied on the computer 
readable storage medium, the computer readable program code executable to program 
a computer to detect linked events comprising the steps of: 

determining source-identified stories (column 3, lines 16-17, wherein "stories" 
means "documents"); 

determining at least one predictive model in the memory for link detection 
(column 9, lines 8-9); 

determining a link between story-pairs based on the at least one predictive model 
and the inter-story similarity vectors (column 10, lines 5-13); and 

indicating the link (column 7, line 67; column 8, lines 1-4). 

Sundaresan et al. does not teach determining inter-story similarity vectors in a 
memory for the at least one story-pair of the source-identified stories. 
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Pirolli et al. teaches determining inter-story similarity vectors in a memory for the 
at least one story-pair of the source-identified stories ( Pirolli et al. , column 7, lines 53- 
65, wherein "pages" could mean "stories"). 

It would have been obvious to one in of ordinary skill in the art at the time the 
invention was made to modify Sundaresan et al. by teachings of Pirolli et al. to include 
determining inter-story similarity vectors in a memory for the at least one story-pair of 
the source-identified stories because it provides the similarity measure of documents. 

As per claim 84 Sundaresan et al. is directed to computer readable storage 
medium comprising: computer readable program code embodied on the computer 
readable storage medium, the computer readable program code executable to program 
a computer to detect linked events comprising the steps of: 

instructions to determine source-identified stories (column 3, lines 16-17, wherein 
"stories" means "documents"); 

instructions to determine at least one predictive model in a memory for link 
detection (column 9, lines 8-9); 

instructions to determine a link between story-pairs based on the predictive 
model and the inter-story similarity vectors (column 10, lines 5-13); ); and 

instructions to indicate the link (column 7, line 67; column 8, lines 1-4). 

Sundaresan et al. does not teach instructions to determine inter-story similarity 
vectors in a memory for the at least one story-pair of the source-identified stories. 
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Pirolli et al. teaches instructions to determine inter-story similarity vectors in a 
memory for the at least one story-pair of the source-identified stories (Pirolli et al. , 
column 7, lines 53-65, wherein "pages" could mean "stories"). 

It would have been obvious to one in of ordinary skill in the art at the time the 
invention was made to modify Sundaresan et al. by teachings of Pirolli et al. to include 
instructions to determine inter-story similarity vectors in a memory for the at least one 
story-pair of the source-identified stories because it provides the similarity measure of 
documents. 

As per claims 85 and 86 Sundaresan et al. as modified is directed to determining 
at least one source-pair statistic for the at least one story-pair is based on at least one 
of a similarity metric and a statistic associated with the metric ( Sundaresan et al. . 
column 3, lines 25-29, wherein the statistical algorithm uses metric for the 
computations). 

As per claims 87 and 88 Sundaresan et al. as modified is directed to at least one 
of the predictive models is a trained predictive model ( Sundaresan et al. , column 10, 
lines 29-33, wherein the "trained predictive model" is determined by use of statistical 
model). 
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15. Claims 6-8, 25-27, 44-46, and 63-65 are rejected under 35 U.S.C. 103(a) as 
being unpatentable over Sundaresan et al. (US Patent 6,606,620 B1) in view of Pirolli et 
aL (US 5,835,905) and in further view Ganqe et al. (US 2004/006559 A1 ), 

As per claims 6, 25, 44 and 63 Sundaresan et al. as modified fails to teach the 
use of probability based metric and a Euclidean based similarity metric. 

Gange et aL teaches the use of Euclidean distance ( Gange et al. , page 3, 
paragraph 0045). 

It would have been obvious to one in of ordinary skill in the art at the time the 
invention was made to further modify Sundaresan et al. as modified by teachings of 
Gange et al. to include the use of Euclidean distance as it is metrics often used in the 
database field to compute distances between similar terms. 

As per claims 7, 26, 45 and 64 Sundaresan et al. as modified fails to teach the 
use of similarity metric is at least one of a Hellinger, a Tanimoto and a clarity distance 
based metric. 

Gange et al. teaches the use of Tanimoto coefficient ( Gange et al. , page 3, 
paragraph 0045). 

It would have been obvious to one in of ordinary skill in the art at the time the 
invention was made to further modify Sundaresan et al. as modified by teachings of 
Gange et al. to include the use of Tanimoto coefficient as it is metrics often used in the 
database field to compute distances between similar terms. 
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A per claims 8, 27, 46 and 65 Sundaresan et al. as modified fails to teach the use 
of inter-story similarity metric is a cosine-distance based metric. 

Gange et al. teaches the use of Cosine coefficient ( Gange et al. , page 3, 
paragraph 0045). 

It would have been obvious to one in of ordinary skill in the art at the time the 
invention was made to further modify Sundaresan et al. as modified by teachings of 
Gange et al. to include the use of Cosine coefficient as it is metrics often used in the 
database field to compute distances between similar terms. 

16. Claims 11-13, 30-32, 49-51 and 68-70 are rejected under 35 U.S.C. 103(a) as 
being unpatentable over Sundaresan et al. (US Patent 6,606,620 B1) in view of Pirolli et 
aL (US 5,835,905) and in further view Zhou (US 2004/0002849 A1 ). 

As per claims 1 1 , 30, 49 and 68 Sundaresan et al. as modified fails to teach the 
inter-story similarity metrics are based on terms in at least one source-identified term 
frequency-inverse story frequency models. 

Zhou teaches the use of frequency-inverse ( Zhou , page 3, column 2, paragraph 
0030, lines 9-11). 

It would have been obvious to one in of ordinary skill in the art at the time the 
invention was made to further modify Sundaresan et al. as modified by teachings of 
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Zhou to include the use of frequency-inverse because it predicts effective example of 
sentence retrieval as stated on page 1, column 1, paragraph 0005 of Zhou. 

As per claims 12, 37, 50 and 69 Sundaresan et al. as modified fails to teach the 
terms in source-identified term frequency-inverse story frequency models are based on 
language. 

Zhou teaches that the retrieved samples are to aid in writing or translation ( Zhou , 
page 3, paragraph 0030, lines 2-4, wherein writing or translating has basis in language). 

It would have been obvious to one in of ordinary skill in the art at the time the 
invention was made to further modify Sundaresan et al. as modified by teachings of 
Zhou to include the inverse-frequency based on language because term comparison 
includes terms of a language. 

As per claims 13, 32, 51 and 70 Sundaresan et al. as modified fails to teach 
determining terms comprises the steps: determining a reference language; and 
determining reference language and non-reference language terms. 

Zhou teaches the changing of sample terms from one mode to another ( Zhou , 
page 3, paragraph 0032). 

It would have been obvious to on of ordinary skill in the art at the time the 
invention was made to further modify Sundaresan et al. as modified by teachings of 
Zhou to include the determination of reference language since the correct translation 
requires the correct reference language. 
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Response to Arguments 

17. Applicant's arguments with respect to claims 1-88 have been considered but are 
moot in view of the new ground(s) of rejection. 

Conclusion 

18. Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Tomasz Ponikiewski whose telephone number is 
(571)272-1721. The examiner can normally be reached on 8:00-4:30. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, Jeffrey A. Gaffin can be reached on (571)272-4146. The fax phone number 
for the organization where this application or proceeding is assigned is 571-273-8300. 

Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). If you would like assistance from a 
USPTO Customer Sen/ice Representative or access to the automated information 
system, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000. 
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